Method, system and computer programme for the generation of an electronic document from scanned image data

ABSTRACT

In a method and system for display of a screen-aided generated of an electronic document from raster scan image data in an application computer, the image data are filed in at least one electronic storage. A first group of image data are provided stored page-by-page and comprises a plurality of pages and a second group of overlay image data are provided in a relationship associated such that one page is associated with at least one overlay and one overlay being associated with at least one page. In a first display screen region, symbols of the first group are displayed per page in an ordered structure. In a second display screen region, image data of the first group are displayed. The document is processed by an editor computer program loaded on the application computer.

[0001] The invention concerns a method, a print system, a computer system and a computer program to generate an electronic document from scanned image data.

[0002] The invention in particular concerns a system in which electronic documents are assembled page by page from scanned image data. The image data can, for example, be generated in that samples are sampled (scanned) and the image data thereby generated are filed page by page and pixel by pixel in an electronic storage. However, the data can just as well be generated completely electronically, i.e. be generated and/or scanned on an electronic device such as, for example, computer.

[0003] A method and a system to process structured images in electronically stored documents is known from U.S. Pat. No. 5,666,503.

[0004] Raster scan editor programs are known for processing of documents that exist as pages in scanned image formats. Such a program is, for example, available from the form Techsoft AS, Sandefjord, Norway under the trade name Pixedit®.

[0005] In the publication “The World of Printer”, Dr. Gerd Goldmann (Hsg.). 5th edition (June 2000), Océ Printing Systems GmbH (Poing), ISBN 3-00-001081-5, what is known as a PRISMApro Server System is specified in chapter 11, that serves to generate electronic documents and to prepare the documents for a high-speed digital printing systems [sic]. A method and a system to impose print data is specified in WO 00.68877 A1 that is suitable for preparation of print data for such a high-speed print system.

[0006] A print device to print overlay formats is known from U.S. Pat. No. 4,944,614. Repetitive fixed image data and variable image data are stored in a storage. Furthermore, the print device comprises a format storage in which image data are stored corresponding to a formula for each document page. The image data of the format storage, the fixed image data as well as the variable image data, are combined with one another for printing.

[0007] The documents WO 00/68877 by the same applicant as well as JP 07182321 A and U.S. Pat. No. 5,715,382 are referred to as further prior art.

[0008] It is the object of the present invention to achieve a secure and fast processing of scanned image data given the compilation of an electronic document.

[0009] This object is achieved with the invention specified in the independent claims. Advantageous embodiments of the invention are the subject matter of the subclaims.

[0010] According to a first aspect of the invention, a first group of image data that comprises a plurality of pages is associated with a second image data group, such that the first group is stored in the storage independent of the second group or is transferred to a storage. Both groups of image data are superimposed in a display or in a printing of an individual page of the document. This superimposition can in particular ensue in a particular storage. The second group of image data can in particular be a plurality of what are known as overlays, that are respectively assembled such that they are to be positioned within a single page of the respectively first group. The overlays thereby respectively comprise information that is positioned within a first page of the first image data, for example a text passage, an image, or a logo. The overlay data can furthermore in particular be associated with size information and/or position information that, in the superimposition, respectively establish the exact position and/or size of the overlay in the first image data or, respectively, within the page.

[0011] According to an advantageous exemplary embodiment of the first aspect of the invention, the logical association of the image information between an overlay and the corresponding image data of the first group ensues according to the relation 1:n and/or according to the relation m:1, whereby n and m are respectively whole numbers greater than 2.

[0012] According to a further aspect of the invention that can be implemented in combination with or also independent of the first aspect of the invention, a first region is provided on a display device (for example a display screen) for processing the document, in which are shown in a tree-like structure the respective file names of the respective image data (what is known as a tree structure area. In a second area (what is known as a viewing area) of the display device, the appertaining image data can be displayed in detail. The respective display is effected via computer control. In the first, tree-like display structure, it is in particular provided to display additional information, at least given contact of the display file name with a selection means such as a mouse pointer. As additional information, it can in particular be provided in a scaled-down special window (what is known as a thumbnail) to display the image data and/or to indicate information about associated overlay data and/or about associated special functions such as, for example, automated programs (batches or macros).

[0013] Furthermore, it can thereby in particular be provided to effect the display of the image data in the second display region (viewing region) only when the corresponding page is specifically selected in the tree structure region, for example by double-clicking with a mouse. The display in the viewing region ensues in particular with a high resolution, such that the image information is easily recognizable given observation with the human eye from an average viewing distance (approximately 25 cm).

[0014] According to a further aspect of the invention, that in turn can be implemented independently of the previously cited aspects of the invention, the document processing system is connected with a document production control system. The connection ensues in particular such that overlay information that is stored in the document production system can be directly received by the print production control system, whereby overlay data must only be transmitted once to the print production control system, and the association of the overlay data with the respectively paged data of the document ensues via referencing that likewise must only be transmitted once. The transmission speed from the document generation system or, respectively, processing system to the print production system or, respectively, to the eventual print device implementing the print event is then possible with higher speed and the least data volume given complete information reproduction.

[0015] In a further preferred embodiment of the latter cited aspect of the invention, printspecific data can be transferred with the start of a print job, such as for example page composition data that comprise the output via simplex-printing, duplex-printing or the like. Additionally, further print production data can also be transmitted that are relevant for generation of a document, for example specifications about the further processing of printed pages for stapling, binding, or the like.

[0016] In a further aspect of the invention, that can in turn be implemented independently of the previously cited aspects of the invention, a plurality of pages or, respectively, their image data are respectively combined into a range and, within the range of image data or, respectively, of the range, form a subrange of image data. The respective ranges and subranges are then associated with main processing processes or, respectively, subordinate processing processes that can be implemented at a specific command, such that the main processes only effect the ranges and the subordinate processes only effect the subranges. Such processing processes can, for example, be the automatic numbering of pages or the removal of image distortions.

[0017] Further aspects and embodiments of the invention are subsequently specified, from which additional advantages of the invention are made apparent. Thereby shown are:

[0018]FIG. 1: a document generation system and printing system,

[0019]FIG. 2: first system components,

[0020]FIG. 3: second system components,

[0021]FIG. 4: third system components,

[0022]FIG. 5: a document processing menu,

[0023]FIG. 6: various document tree structures,

[0024]FIG. 7: tree structures with markings,

[0025]FIG. 8: a selection menu,

[0026]FIG. 9: an overlay structure,

[0027]FIG. 10: the schema of a data transmission,

[0028]FIG. 11: a selection menu for association of overlays,

[0029]FIG. 12: a selection menu for association of automatic processes,

[0030]FIG. 13: a selection menu for a print monitoring system,

[0031]FIG. 14: a selection menu for print production parameters.

[0032]FIG. 1 shows a printing system in which electronic documents are transmitted from a print data source 1 to a print server 2. The print data source 1 can, for example, be a host computer in which the documents exist in a print data language such as Postscript, Page Command Language (PCL), Advanced Function Presentation (AFP) or the like. The data thus transmitted are screened in the print server 2, i.e. converted page by page into per-pixel data (bitmap data). Data formats such as Graphics Interchange Format (GIF), Taped [sic] Image File Format (TIFF), or other bitmap data can thereby be generated. In place of a host computer, the data source 1 can also be a scanner that scans the physically present documents, for example paper pages, and generates from them electronic data that are then transmitted as bitmap data to the print server 2.

[0033] An application computer 3 (personal computer, PC) at which the scanned, paginated document data stored in the print server 2 can be processed is connected with the print server 2.

[0034] The thusly processed document data are then edited in the print server 2 such that they are suitable for output to a printer 4. The connection between the individual devices 1, 2, 3 and 4 can thereby alternatively be either individual direct connections via data lines or also network connections via a Local Area Network (LAN) or a Wide Area Network (WAN) such as, for example, the Internet.

[0035] User PC 3, print server 2 and printer 4 are once again shown in FIG. 2, whereby various system modules, i.e. software programs installed on the respective devices, as well as their functional connections, are shown.

[0036] The central software module in the user PC 3 is a document editor 10 that, on the one hand, comprises an editing module 11 and, on the other hand, comprises a graphical user interface 12 with which processed documents can be transmitted to further processing modules, such as to an imposition module 13 (Profession Document Composer, PDC) or an external display program (Professional Document Viewer, PDV).

[0037] A method and a system for imposition of print data that can run in a corresponding imposition module 13 is, for example, specified in WO 00/68877 A1. Also, in the previously cited book “The World of Printers” (ISBN 3-00-001081-5), on the pages 11-15 through 11-18 under the title “PRISMA+PoD”, further details of the Professional Document Composer 13 are specified. The content of both of these publications are hereby included by reference in the present specification.

[0038] The document editor 10 can also cooperate directly with a scan module 15, with which a scanner (scanning device) known to it can be operated or, respectively, its data can be directly input into the document editor 10.

[0039] A print output module 16 additionally runs on the user PC, with which data from the document editor are edited such that they can be output print-adapted via the print server 2 to the printer 4 with higher speed. This print job manager module 16 (PJM) is connected with a job processing system (order distribution system, ODS). Both modules are also installed in the print server 2, such that a connection is enabled between user PC 3 and print server 2. Furthermore, a language converter is comprised in print server 2 that converts the print data for output at the printer 4. On an arbitrary client console (Client PC 13) connected to a network, the print job manager 16 generates for each print job what is known as a job ticket and transmits the job tickert [sic] via the network to the order distribution system. It is possible by means of the PJM 16 to generate various resources (for example print data for reprints, job tickets, overlay marks, variable data sets) of an AFP data stream (advanced variable data stream) in the system, and to deposit them on the system. Moreover, the satus [sic] of the job in the system can also be monitored.

[0040] Further details of the print job manager module 16 and of the order distribution system cooperating with it are specified later in connection with FIGS. 13 and 14. Further details of the print job manager module 16 are also specified in the previously cited publication “The World of Printers [sic] (ISBN 3-00-001081-5) in chapter 11 (in particular on the pages 11-11 through 11-15). This publication is hereby likewise included by reference in the present specification.

[0041] The system components are shown in somewhat more detail in FIG. 3, whereby the respective data formats that can be processed are also specified. On the input side, various modules are provided in which the respective input data are either directly or incrementally converted to the TIFF format. Via a first data input, standard print languages or, respectively, standard print formats such as Postscript (PS), Printer Control Language (PCL) or Portable Document Format (PDF) are supplied to a language converter 20 that converts the data into the data format Image Object Content Architecture (IOCA). These data are then supplied to a TIFF data converter 21 that forms Tagged Image File Data (TIFF) from the IOCA data. Additionally, TIFF data can be directly input via a TIFF input module 22 or via the scanning module 15 already previously cited into the document editor 10.

[0042] The TIFF data input at the data input 23 are converted in a converter module 25 into an internal (temporary) data format (TDF) in the editor module 11. This module can, on the other hand, in turn generate IOCA data from the internal data format TDF, as well as parameters, that are transferred to the job submission module 12. To print out data from the document editor 10, the possibility exists either to transfer data via the output interface 26 to traditional printer drivers 27, or to transfer the IOCA data at the data output 24 to the print job manager module 16 of the print production module 18. Within the print production module 18, various modules are provided with which the document data are either preprocessed for output to a printer or for reproduction at another application program. Among these are the professional document composer 13 (with which an imposition schema and other parameters necessary for imposition can be determined), the print job manager 16 (with which the job data are administrated), and a print file manager (PFM 28) (that administers the various files of the document). With an output converter 29, the data are finally converted such that they can be further processed by an interface 17 for a user program, or by an interface 19 for printers.

[0043] Based on the editor module 11, the possibility exists to direct processed document data out via two different output channels. On the one hand, it can ensue via the output interface to the job submission module 12 specified above. On the other hand, it can ensue via the output interface 26 to the output drivers 27. It can thereby be provided to provide an output driver for the starting of a printer 14, or an output driver to output the document data in a file (print to file) 30.

[0044] The process to generate a print job is shown in FIG. 4. Within the document editor 10, it is provided for this to start the job generation module 12 via a function call from the editing module 11. Via this function call, job parameters such as the job name and further details that are specified later, as well as IOCA data from the editing module 11, are transferred to the job generation module 12. Furthermore, within the job generation module 12 thereby called, user-specific specifications are queried via an input window (see FIG. 14) before the job is transferred in the form of IOCA data or a special job file (Job Ticket File, *.TIC) to the print job manager or, respectively, the order distribution system 16.

[0045]FIG. 5 shows the basic processing window 35, which is brought by the editing module 11 to the graphical user interface (GUI) of the user PC 3, for example for display on a monitor.

[0046] The entire display menu is subdivided with push buttons into display surfaces and into function regions. In a first display region 44, all files stored on a fixed storage region (for example, within a defined directory on a hard drive) that should be merged into a document are displayed in a tree-like structure. For example, the files <127><140> are respectively individual pages of the document “1d3-ein.ps”. The page <127> is thereby displayed in the display region 43 at a resolution of approximately 200 dpi, which is easily recognizable with the human eye. Depending on the resolution capability of the display device (screen), the display can be controlled up to the original resolution of the document of, for example, 300 or 600 dpi. The precise position of the information relative to presented, predetermined page borders can be recognized via rulers 45 and 46, whose reading is facilitated via a crosshair 47.

[0047] Various functions and sub-menus can be called with the main menu row, and with the pushbuttons 37 standard functions (tools) can be called per push of a button. In the region 38 (Tools Style Bar), special adjustments for stylized objects can be effected. In the overview window 39 is, respectively, the activated page, here page <127> completely displayed and marked (which is visible currently in the detail display region 44). It is visible in the layer control region 40 whether the current displayed page has different, superimposed layers. Each of the layers is displayed in a freely selectable color, whereby the corresponding colors are also visible per push of a button in the detail display region 44 and in the overview window 39.

[0048] In order to facilitate the navigation within the entire document in the tree structure display region 41, a supplementary window 42 (what is known as a thumbnail) is dynamically opened for an individual page. Similar to the overview window 49 for the current completely displayed page, in the thumbnail 42 the respective page is shown raster scanned in scaled down depiction, for example at approximately only 50 dpi (with reference to the original size or, respectively, to the complete raster scan information of the original document). The thumbnail 42 appears automatically and dynamically as soon as the corresponding page number is contacted with the mouse pointer 50, in the example of FIG. 5 the page <131>. Depending on control execution, further information of the respective page in addition to the document content are shown in the thumbnail display 42, for example the page size, its disc space and/or the resolution of the pixels of the stored document in dots per inch (dpi). The thumbnail function is quite helpful in order to wander through the tree structure and to quickly locate specific pages.

[0049] Starting from the main processing window 35, diverse sub-menu windows can be called (see FIGS. 8, 11, and 12).

[0050] Within the toolbar pushbutton 37, the production print display of FIG. 14, via which the input of parameters ensues to control the print job manager or, respectively, the order distribution system module 16, can be called with the pushbutton 48. The overlay positioning push button of the FIG. 11, via which the association between overlays and page data ensues, can be activated with the push button 49. The menu of FIG. 12, in which the regional processing of automatic functions for automatic processing of image data ensues, can be called via the push button 51 (Range Batch Processing). Which page is currently displayed within the detail display 44 is displayed in the tree-display region 41 with the identification symbol 52 (a glasses symbol).

[0051]FIG. 6a again shows a tree structure 16 which is displayed in the tree structure display region 41. Chapters (chapter 1, chapter 2) of a document and corresponding sub-chapters (chapter 2-1, chapter 2-2) are thereby respectively displayed in indented, structured form, and under them the respectively appertaining individual pages are listed. FIG. 6b shows a modified form in which, respective to a structured unit (a previously defined region), appertaining overlay information and/or automatic action information (batch information) are displayed. It is recognizable in the example shown in 6 b that all pages of chapter 1 are connected with the overlay “OL1” and, for example, additionally page <2> is connected/superimposed with the overlay “OL2”, as well as processed by an automatic batch “BA3”. Similar to the previously specified thumbnail display in the page region, a thumbnail display 61 is also dynamically provided for overlays or, respectively, batches; they are displayed as soon as the corresponding overlay display or, respectively, batch display is contacted with the mouse pointer 50. Either the respective overlay information or a function specification (for example specal removal) of the corresponding batch is thereby displayed. By clicking the mouse on the symbol of an individual page within the tree structure 60, this page is displayed in the detail display region 44 and is thus accessible for further processing steps with regard to pixel manipulation and page processing. The corresponding page is indicated within the tree structure 60 by the identification symbol 52.

[0052] A corresponding display can also ensue for entire page ranges. Entire page ranges are thereby either marked with the mouse pointer 50 or via corresponding key functions on the keyboard (Shift-key and Control-key as well as corresponding arrow-movement keys). For such selected ranges, only respectively the first or the last page of the range is displayed in the detail display region 44. All of functions (operations) assigned to the range are applied to all of the pages. When a page range is selected, such functions that can not be applied to an entire range (for example the pixel manipulation of an object, which is only available in an individual page) are then not offered for selection or, respectively, inaccessible.

[0053] Within a document or a region of the document, the logical sequence of the pages can be changed. Individual pages, page groups, or page ranges can be shifted to different positions within the document. The physical data are not thereby changed with regard to their order within the memory management for the stored image data, but rather merely the allocation of the respective start addresses in an allocation table associated with the document or, respectively, range is separately stored.

[0054] It is shown in FIG. 7 how a new chapter can be generated from a group of marked pages, i.e. the marked area 65 becomes a new chapter 66. For this, the menu item “Edit” is called in the main menu 36, and function “add chapter” is called. An operation window thereby appears in which the chapter name can be input, such that the chapter designation also appears in the tree structure 60 as a name.

[0055] In FIG. 8, in addition to the menu item “add chapter”, some other menu items are shown that are of importance for an efficient processing of such electronic documents. For example, it is enabled with the menu item 67 “page numbers” to automatically generate the page numbering. The connection to the print production module 16 (print job manager and order distribution system) is directly produced with the menu item 68 “Production Print”. In a print production work process (print production work flow), it is of great importance to be able to largely automatically process page ranges. For example, the precise positioning of the print data on the page is an elaborate process that can be substantially accelerated in that specific page-referenced specifications are valid for the entire document (meaning for all pages or at least for a specific region of the document, for example for chapters). Such a regional association can already ensue in the generation of the document, even though the individual document pages are respectively individually scanned. It is thereby in particular advantageous to enable the following associations:

[0056] individual pages, for example pages 1,5,9, whereby a specific association or, respectively, automatic processing ensues on the pages 1, 5 and 9 page ranges, for example pages 1-10

[0057] staggered association, for example pages 1-50: 2, meaning an operation or, respectively, association effects every second page within the range between the pages 1-50. Such an operation is, for example, advantageous in order to respectively treat the front of a page and the back of a page differently combination of the associations cited above.

[0058] In the course of the region-by-region or also page-by-page automatic processing, diverse functions can be provided such as they are, for example, specified in FIGS. 11 and 12.

[0059] It is now specified using the FIGS. 9 through 11 how image data of a first page or a first group of pages are connected with overlay data. FIG. 9 shows the logical principle in the association of overlay data with page data. The page data thereby comprise the pages P1, P2 . . . Pn and the overlay data O1, O2 . . . Om. A maximum of m overlays can thus be associated with each page, and a maximum of n pages can by associated with each overlay, whereby m,n are respectively positive whole numbers. The logical association of the pages ensues (as is depicted in FIG. 10) within the print server 2 in a reference data storage 70, whereby in turn an association to coresponding [sic] ranges or, respectively, individual pages can be made. The overlay data are separated from this in the storage 71, and the page data are filed in storage 72. It is possible via the corresponding association of reference data between the overlay data and the page data to substantially relieve the data transfer from the print server 2 to the printer 4, in that the page data, the reference data, and the overlay data are respectively transferred from the print server to the printer separated from one another. This enables each of these data sets to respectively by transferred only once, i.e. to transfer the data of an overlay respectively only once from the print server 2 to the pritter [sic] 4. In contrast to conventional methods in which each page was transferred together with respectively all associated overlays, this significantly relieves the data transfer from the print server 2 to the printer 4, because overlay data and/or page data that occur repeatedly and identically in the document must only be superimposed once. This advantage is achieved above all when n or m are respectively whole numbers greater than 2.

[0060] An overlay can thereby comprise either an image element which is positioned in a page or also entire page information. Within the reference data 70, the corresponding data for each overlay are stored, for example its size, its name, its position within a page as well as the page range or, respectively, the individual pages with which this overlay is associated. An overlay can thereby be associated with n pages, and a page can be associated with respectively m overlays.

[0061] Both the layers O1 through On and the pages P1 through Pn can mutually be stored in a single IOCA file. However, the separate storage (shown in FIG. 10) in storage ranges 71, 72 in different IOCA files is advantageous.

[0062] Within the document editor 10 (compare FIGS. 2, 3 and 4), the storage already ensues in the editing module 11, whereby the corresponding data are then transferred from the editing module 11 to the job submission module 12 before they are transmitted to the print server 2 or, respectively, to printer 4.

[0063] In FIG. 4, a selection menu window and association window 75 is shown which can be called from the editor module 11 via the push button 49 (FIG. 5). The association window 75 comprises a first region 76 in which the overlay data can be selected. In a second region 77, the positioning of the respective overlay data within the respective pages and the association to the corresponding pages or, respectively, ranges is displayed. In a zone 99, the currently selected overlay (here the overlay Nr. <5> displayed in the tree structure 80) is completely displayed. As soon as an overlay is selected, the detailed positioning of the overlay on the respective pages can be determined via the push button 81, for. example the distance of the overlay from the respective page borders, its precise position and/or an enlargement or, respectively, shrinking of the overlay information within the page.

[0064] Additionally, it is determined for each overlay with which pages it is associated. For this, all pages, all even or odd pages, or a specific page range by numbers can be specified via page selection window 82. Alternatively to this, an already existing range, for example entire chapters, can also be selected via the chapter selection region 83. A regeneration of ranges can also ensue in a region 84, and the overlays are associated with this new range. Via the association, association data are obtained via which the later, speed-optimized, one-time transmission of the overlay data and the page data is possible, as well as their page-precise composition and repositioning within the print server.

[0065] Finally, individual pages can be associated via the individual selection region 85 with the respective overlay, whereby a first inspection of the overlay superimposition can ensue in the superimposition display zone 78 for the respectively selected page.

[0066] The association is completed, and the corresponding association parameters stored, via the push button 86. These stored data are further processed in a later print job processing process by the print job manager 16, the other distribution system and/or the profession document composer 13 or, respectively, the professional document viewer, such that the correct association between overlays and pages can ensue with the corresponding systems both page by page and within the page.

[0067] An association window 90 to associate predetermined automatic processing processes for individual pages, ranges or documents is shown in FIG. 12. This window can also be directly called by the processing window 35 via the push button 51. Within the window 90, the correspondingly automated processing events can first be called in the selection field 91, such as, for example, removal of image specks (“specal removal”), the automatic numbering of pages, the automatic centering on the page (cropping), as well extending functions or rotation functions. The respective functions are graphically illustrated, or at least specified via clarifications, in the demonstration field 92.

[0068] Those pages, ranges or documents in which the respectively automated event (batch) should run are then selected in the selection region 93. As previously, adjustments can thereby be selected in the field 75 between page-by-page and region-by-region. Additionally, an association for complete documents, or also a plurality of documents, can also ensue in the zone 94. The image processing event for whole documents is hereby accelerated. New automated actions can be formed or, respectively, assembled from known actions via the push button 95. Moreover, it can thereby be provided to completely accept already assembled automatic actions (batches), inclusive of the range specifications associated with them, into a new batch to be generated.

[0069] A base window of the print job manager 16 is shown in FIG. 13 that is started via the function call 68 in FIG. 8. The substantial specifications for a print job (such as the job name, the number of the copies to be generated, the printer to be activated, and so forth) are specified in the region 101. Job attributes are specified in the field 102, and all active (i.e. jobs presently located in processing for printing) as well as all already edited jobs are listed in the field 103. The files belonging to a currently selected job are listed in the window 104.

[0070] A selection menu 110 is shown in FIG. 14 which likewise can be called from the menu item 68 (Production Print) of the FIG. 8. Various parameters can thereby be adjusted that are part of the editing of a print job belonging to a complete document, such as the imposition of the professional document composer, the print mode (simplex, duplex, tumble) as well as the inclusion if special pages such as cover pages and divider pages. Further specification for page setup (paper size, paper orientation, borders, etc.), as well as supplementary adjustment for post processing devices of the printer such as stapling device, barcode labelers and so forth, can be adjusted. Information necessary for the print job manager, such as print job name, printer name, number of the copies, and so forth can also be specified via this adjustment menu.

[0071] Exemplary embodiment of the invention were specified. It is thereby clear that the invention can at any time be combined with known functionalities. The invention can in particular be used in a print production environment and comprises devices such as computer, printer, but also control software (computer programs and their modules) or, respectively, computer program products such as, for example, files and volatile as well as non-volatile data storage that, upon loading onto a computer, effect the sequences specified above.

[0072] Although the acceptance of the raster scanned page data and overlay data as well as the referencing data or, respectively, indexing data by the document generation program was specified by an example of a print production system, this acceptance could also ensue on another document management system, and thereby lead to a high transfer speed. For example, such an acceptance in the course of the transfer of the raster scanned data in a network (LAN, WAN) by an electronic system (server, e-mail program, document management program) to a corresponding or even identical second program can be very advantageous.

[0073] To recapitulate, it is established:

[0074] The invention concerns a method and a system to process image data scanned page by page to generate an electronic structured document. The processing function modules are functionally coupled with a print production system that comprises a plurality of sub-modules (print job manager, order distribution system), such that in the generation of the document and in its forwarding to the print production system, a time-optimized work flow is achieved. A storage-optimized type of superimposition of image data ensues via logical association of overlay data with per-page data or data ranges. Furthermore, additional functions are provided, such as the dynamic display of information about individual document pages and/or of processing information for the respective document pages. A per-region processing of the data is also provided.

[0075] Reference list

[0076]1 central computer

[0077]2 print server

[0078]3 user computer

[0079]4 printer

[0080]10 document editor

[0081]11 editor module

[0082]12 job submission module

[0083]13 professional document composer

[0084]14 professional document viewer

[0085]15 scan module

[0086]16 print job manager

[0087]17 interface for user program

[0088]18 print production module

[0089]19 interface to printer

[0090]20 language converter

[0091]21 IOCA converter

[0092]22 read module

[0093]23 data input

[0094]24 data output

[0095]25 converter module

[0096]26 output interface

[0097]27 printer driver

[0098]28 print file manager

[0099]29 output converter

[0100]30 print output file

[0101]35 processing window

[0102]36 main menu

[0103]37 toolbar (push buttons)

[0104]38 toolstyle bar

[0105]39 overview window

[0106]40 layer control area

[0107]41 tree structure display area

[0108]42 thumbnail display

[0109]43 tree display area

[0110]44 detail display area

[0111]45 vertical ruler

[0112]46 horizontal ruler

[0113]47 crosshair

[0114]48 print production push button

[0115]49 overlay positioning push button

[0116]50 mouse pointer

[0117]51 range batch processing push button

[0118]52 identification symbol for the page currently shown in the window 44

[0119]60 tree structure

[0120]61 batch thumbnail

[0121]65 marker area

[0122]66 chapter

[0123]67 page number

[0124]70 reference data storage

[0125]75 overlay association window

[0126]76 overlay selection area

[0127]77 overlay positioning area

[0128]78 transfer display field

[0129]79 overlay display field

[0130]80 overlay directory

[0131]81 overlay positioning push button

[0132]82 page selection fields

[0133]83 chapter selection area

[0134]84 chapter relocation area

[0135]85 individual page selection area

[0136]86 operation push button

[0137]90 batch association window

[0138]91 batch selection field

[0139]92 batch demo field

[0140]93 batch area selection

[0141]94 batch document selection

[0142]95 push button for new batch

[0143]100 PJM base window

[0144]101 job parameter window

[0145]102 job attribute window

[0146]103 job window

[0147]104 file window

[0148]110 selection menu

[0149] O1 . . . On [sic] overlays

[0150] P1 . . . Pm [sic] pages 

1. Method for display screen-aided generation of an electronic document from raster scanned image data in an application computer (3), whereby the image data are filed in at least one electronic storage (72), a first group if image data (P1 . . . Pn) stored page-by-page that comprise a plurality of pages (main document, range), a second group of overlay image data (O1 . . . Om) in the relationship if 1:n or m:1 is associated such that one page (P1, P2, . . . ,Pn) is associated with at least one overlay (O1, O2, . . . , Om), or one overlay is associated with at least one page, and whereby n and m are respectively whole numbers greater than
 2. in a first display screen region (41), symbols (<1>, <2>, . . . ) of the first group are displayed per page in an ordered structure (60), in a second display screen region (44) image data (P1 . . . Pn) of the first group are displayed in detail, and the document is processed by means of an editor computer program (10) loaded on the application computer (3).
 2. Method according to claim 1, whereby the first group (P1 . . . Pn) is stored in a storage (71) independent of the second group (O1 . . . Om).
 3. Method according to claim 2, whereby the first group (P1 . . . Pn) is transferred independent of the second group (O1 . . . Om) from a first device (PC, 3) to a second device (2,4), and the second group (O1 . . . Om) is first superimposed with the first group (P1 . . . Pn) for a display or for a printing of each individual page.
 4. Method according to any of the preceding claims, whereby the association ensues such that the image data of both groups (P1 . . . Pn; O1 . . . Om) are connected with one another via referencing data.
 5. Method according to claim 4, whereby the overlay data and the referencing data are respectively transmitted only once to the print production control module (PJM, 16) to print the document.
 6. Method according to any of the preceding claims, whereby a mutual storage of the overlay data (O1 . . . Om) and the per-page image data (P1 . . . Pn, main document) ensues as a single IOCA document.
 7. Method according to any of the preceding claims, whereby the association ensues such that the first image data are respectively a document page, and that the second image data are an overlay (page, logo, image) with which is respectively associated position information for superimposition with the first image data.
 8. Method according to any of the preceding claims, whereby respectively one group of image data (ranges) are selected for processing, at least one attribute (chapter, color, two-up, overlay-property . . . ) is associated with the group of image data (ranges), an automatic processing process (batch, macro) is selected or assembled from a plurality of selectable processes, and the group of image data (ranges) is processed with the selected processing process (batch, macro).
 9. Method according to any of the preceding claims, whereby the attribute comprises the association with overlays and/or the association with automatic processing processes.
 10. Method according to any of the preceding claims, whereby in a first display region (41) the content of the first group of the associated document page (P1 . . . Pn) is displayed in a separate display area (42) given contact of a symbol (<1>, <2>, . . . ) with a selection means (50).
 11. Method according to claim 10, whereby attributes of the image data such as storage size, resolution are displayed in the first screen area (41).
 12. Method according to any of the preceding claims, whereby a connection is produced from a document editing module (11) effecting the method sequence to a print production control module (PJM, 16).
 13. Method according to claim 10, 11 or 12, whereby print-specific data are transferred, such as page composition data (simplex, duplex, tumble), print post-processing data (stitching, binding) and/or job submission data (job name).
 14. Method according to claim 13, whereby job submission data (job ticket) are separately indicated as such and are first processed by the print system and/or the print production control module (16) when the print event is first triggered at a print device (4).
 15. Method according to any of the claims 12 through 14, whereby the image data are supplied to a print device (4) and there printed out.
 16. Method according to any of the preceding claims, whereby the data are at least partially processed in the TIFF format.
 17. Method according to any of the preceding claims, whereby the image data can be selected per-page or per-region for full display in a second display area (44) of display means.
 18. Method according to any of the preceding claims, whereby a page sequence that is predetermined per image segment via logical attributes (page number) is associated with the image data within the document.
 19. Method according to claim 18, whereby the sequence within the group or sub-group can be changed such that only the logical association (attribute) is changed, but not the storage location in the storage (70, 71, 72).
 20. Print system with means to implement a method according to any of the claims 1 through
 19. 21. Computer with means to implement a method according to any of the claims 1 through
 19. 22. Computer program that upon loading onto a computer effects a method sequence according to any of the claims 1 through
 19. 